AD-A096  965  FLORIDA  UNIV  GAINESVILLE  CENTER  FOR  MATHEMATICAL  SYS— ETC  F/6  5/3 

I DENT IFI ABILITY  ANO  PROBLEMS  OF  MODEL  SELECTION  FOR  TIME-SERIES— ETC <U> 
I960  RE  KALMAN  AFOSR-76-303* 

AFOSR-TR-61-0229  NL 


UNCLASSIFIED 


AG96965 


UNCLASSIFIED 


|^^J>E:1E®W”i::: 


REPQEJLDOCUMENTAflG 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


2  29  y 


TKTVE-  fatitf-SubUtU) 


DENTIFIABILITY  MID  PROBLEMS  OF  J40DEL  ^ELECTION 
CR  TIME-SERIES  ANALYSIS  IN  ECONOMETRICS  # 


^  £ 


7.  AUTHORfa; 


PfRMJWMTWO  ~On&.- Ht>M0€0 


B-  CONTRACT  OR  GRANT  MJMBER(s) 


AF03R -76-303^ 


R .  E  •  j  Kalman 


9.  performing  organization  name  and  address 

Center  for  Mathematical  System  Theory  ,s~p 

University  of  Florida  lL/.  A  h  A  0  /4 

Gainesville,  FL  32611  T)l  I 


11.  CONTROLLING  OFFICE  NAME  AND  ADDRESS 

Air  Force  Office  of  Scientific  Research  /NM  (II. 
Bolling  AFB,  DC  20332 


1 4.  MONITORING  AGENCY  NAME  ft  ADDRESS^/  different  from  Controlling  Office)  IS.  SECURITY  CLASS,  (of  this  report) 


10.  PROGRAM  ELEMENT,  PROJECT.  TASK 
AREA  ft  WORK  UNIT  NUMBERS 

Qb  H/ 


16.  DISTRIBUTION  STATEMENT  (of  (his  Reporl) 


UNCLASSIFIED 


15a.  DECLASSIFICATION  DOWNGRADING 
SCHEDULE 


Approved  for  publle  roleaoo ; 
distribution  unlimited. 


,  V.  -|  /-«> 


^  *  F 


17.  DISTRIBUTION  STATEMENT  (of  the  abstract  entered  In  Block  20,  If  different  from  Report) 


10.  SUPPLEMENTARY  notes 


Presented  as  an  invited  lecture  at  the  Fourth  World  Congress  of 
the  Econometric  Society,  Aix-en-Provence,  France,  August  30,  1980 


19.  KEY  WORDS  (Continue  on  reverse  side  if  necessary  and  identify  by  block  number) 

Identification,  simultaneous  model  selection 


20.  ABSTRACT  (Continue  on  reveres  tide  II  necessary  end  Identity  by  block  numb-r) 

“Identification  and  model  selection  in  econometrics  arc  examined  from  a  syr 
theoretic  point,  of  view.  A  critical  study  of  t.h<»  oxirtim*  technique.-,  in 
econometrics  is  carried  out  through  several  examples  from  li  ter.ature . 


DD  ,:r7J  1473  EDITION  OF  I  NOV  6S  IS  OBSOLETE 


_ _ UNCLASSIFIED 

SECURITY  CLASSIFICAT  ON  o«  Tml  rape  3C 


SFOSR-TR-  8  1  -  0  2  2  9 

resent,!!  -I.",  n  i  nv  I  >  *;<i  lecture  at,  t.»i- •  Fourth  Wor3.il  Congress  of  the 
Econometric  Society,  Aix-eri- Provence,  FRANCK,  Aupdist  30,  19&0* 


I'D  ENT  IF  TAB 1 1.  TTY  AffD  PROBLEMS  OF  MODEL  SELECTION  FOR 
Tl  ME- SERIES  ANALYSIS  IN  ECONOMETRICS* 

by 

R.  E.  Kalman 

;!wi Federal  restitute  of  Technology,  ZUrich,  SWITZERLAND 
University  of  Florida,  Gainesville,  FL  USA 


^  *"■*?**■  ? 

U  s  i\^ 

Cl  E 

MAR  3  0  1981. || 


*  Thin  research  was  supported  in  part  under  US  Air  Force  Grant  AFOSR— 
76-3034^  and  US  Army  Research  Grant  DAA  29-77-0-0225  through  the 
Center  for  Mathematical  System  Theory,  University  of  Florida,  Gaines¬ 
ville,  FT,  32611  USA. 


SI  3  27 


Accensibtf  16  f 

" "ntis 

PTIC  T/fB 

Unannounced 

Justification. 


By - 


Distribution/ _ _ 

Availability  Codas_ 

and/of 


.  I  AVUUOUi**  vy  - 


MV 


REKAIX  i  ^re  1 
Rev.  3,  Ol/l3 /•  51 


1.  BAfKGROUND  AND  PERSPECTIVE 

Econometrics  was  born  in  the  1920's  from  the  hope  that  it  would  be 
the  embodiment  of  a  noble  dream.  Today,  with  the  hindsights  of  more  than 
fifty  years,  this  dream  may  be  described  precisely,  as  follows: 

"Economics  deals  with  complex  phenomena  which  make  it  impossible 
to  isolate  quantitative  relationships  between  important  variables  (say, 
taxations  vs.  savings),  nor  is  it  possible  to  perform  experiments  or  direct 
observations  that  would  isolate  these  relationships  or  at  least  diminish 
the  noise  level  under  which  they  are  observable.  We  possess,  however, 
innumerable  time  series  generated  by  the  economic  "forces"  we  wish  to 
discover.  By  constructing  models  for  economic  time  series  we  may  hope  to 
obtain  indirect  access  to  the  desired  quantitative  economic  relationships 
since  these  are  "encoded"  in  the  models  and  therefore  must  be  recoverable 
from  the  models'  structure  and  parameters.  Moreover,  since  economic  truths 
are  immutable,  at  least  in  the  short  run,  there  is  no  reason,  in  principle, 
why  the  models  cannot  be  accurately  determined,  in  spite  of  disturbances, 
errors,  irrationalities,  expectations,  and  other  random  influences  which 
afflict  the  available  time  series." 

My  personal  analysis  of  the  preceding  scenario  is  that  it  forces  the 
study  of  economics  '■o  be  a  system- theoretic  endeavor.  Economics  as  a  science 
requires  methods  which  are  effective  in  the  exploration  and  explanation  of 
interne  tier  phenomena.  Thus,  implementing  the*  dream  of  econometrics  becomes 
a  problem  in  system  theory,  a  discipline  which  was  not  even  in  sight  in  the 
1920's. 

Hut,  unfortunately,  the  actual  evolution  of  econometrics  took,  a  different 
route  and  came  to  be  dominated  by  statistics;  the  result  is  that  today  econ¬ 
ometrics  has  drifted  apart  from  the  scientific  mainstream  of  the  1950's  and 
Vy  0 ' :: . 

During  this  period  system  theory  has  reached  a  certain  level  of  maturity, 
rt  is  now  beginning  to  provide  the  scientific  framework  within  which  basic 
ideas  of  economics  and  econometrics  can  be  reexamined  and  subjected  to  deep 


REKADC  Page  2 
Rev.  3,  01/13/81  cb 


analysis  and  critique.  Some  of  its  current  beliefs,  procedures,  and  even 
results  must  be  changed  if  econometrics  is  to  continue  to  play  its  role  on  ie- 
a  world  scene  where  massive  computer  analysis  and  advanced  mathematical 
methods  of  modeling  compete  with  it  in  attacking  the  same  basic  problems.  it 

As  I  have  pointed  out  in  several  previous  publications  (see  KALMAN  ir.n 
[1979a>  1979*>>  1980a,  1980b]),  research  in  modeling  cannot  be  done  meaning-  iSo 
fully  if  it  is  viewed  as  a  problem  which  is  inseparable  from  a  particular  Jtr; 
field  of  application.  Habits  of  economic  modeling  cannot  be  justified  solelgc 
by  economic  reasoning.  Modeling  has  its  own  logic,  independently  of  what  is*-,-, 
being  modeled.  And  the  questions  raised  by  modeling  logic  may  be  far  more  s  . 
important  than  practical  problems  such  as  reliability  of  data,  use  of  prior*  - 
economic  theory,  statistical  methodology,  and  the  like. 

The  papers  just  quoted  are  concerned  with  general,  even  philosophical,  ... 
questions  of  modeling.  In  this  paper  we  shall  focus  on  a  specific  question;^, 
the  concept  of  identifiability. 

Econometricians  have  become  accustomed  to  formalize  the  process  of  modeic 
ing  by  asking  whether  or  not  the  parameters  describing  a  model  may  be  deduced., 
as  a  function  of  (i.  e.,  uniquely  determined  by)  the  "information"  or  "data"  - 
which  is  available  for  the  construction  of  the  model.  If  this  requirement  r._ 
is  met,'  the  parameters  are  said  to  be  identifiable. 

Such  a  notion  of  identifiability  is  unexceptional  on  purely  conceptual  ^  • 
grounds.  But  this  is  not  enough.  Hie  concept  of  "parameter  identifiability" 
must  also  make  system- theoretic  sense.  Whether  this  is  so  or  not  can  be 
objectively  decided  because  the  notion  of  a  model  is  a  precisely  defined  nc- 
concept  in  system  theory. 

When  "identifiability"  is  critically  examined,  several  important  new  it. 
questions  arise  which  have  not  been  considered  by  econometricians.  *»■ 

First,  it  is  nontrivial  whether  or  not  the  model  is  well-defined  in  the:  . 
mathematical  sense.  There  are  many  instances  in  the  econometrics  literature 
where  this  is  just  not  the  case.  System  theory  provides  criteria  for  discard¬ 
ing  models  which  are  not  well-defined, 

Second,  assuming  now  that  the  model  is  indeed  well-defined,  it  follows,,. 


EEKA.DC  Page  3 
Rev.  2,  01/14/81  cb 


essentially  as  a  matter  of  definition,  that  the  model  or  models  compatible 
with  the  data  is  or  are  necessarily  identifiable  in  the  abstract  sense. 

The  difficulty  is  nonuniqueness,  not  identifiability. 

Third,  there  is  the  question  of  parametrization  of  models.  In  econo¬ 
metrics,  ''parameters”  are  used  in  the  descriptive  sense;  in  other  words, 
simply  to  give  a  mathematical  specification  of  the  model.  The  trouble  is 
that  a  model  is  an  abstract  mathematical  object.  It  is  subject  to  various 
assumptions  and  restrictions  in  addition  to  its  describing  parameters. 

Therefore  its  intrinsic  parametrization,  which  reflects  its  precise  mathe¬ 
matical  attributes,  is  usually  quite  different  from  the  naive  (descriptive) 
parametrization.  The  problem  of  parametrization  of  a  model,  as  of  any 
abstract  mathematical  object,  is  a  highly  nontrivial  mathematical  problem 
and  is  not  accessible  to  the  kind  of  elementary  reasoning  that  has  been  used 
in  econometrics  or  time-series  analysis.  The  confusion  between  descriptive 
and  intrinsic  parameters  is  deep-seated  and  it  is  not  easy  to  clarify  it  on 
the  intuitive  level. 

In  short,  "parameter  identifiability"  as  a  scientific  concept  is  of  no 
utility.  Conceptually,  it  is  just  not  a  good  tool  for  probing  deeper  proper¬ 
ties  of  models.  The  basic  tasks  are  to  study  models  which  arise  in  time- 
series  identification  problems  and  to  do  the  concomitant  mathematical  research 
concerning  their  in-variants,  local  coordinates,  or  intrinsic  parameters — 
all  three  terras  being  synonymous. 

The  issues  involved  will  be  illustrated  in  Sections  3  through  5  by  simple 
examples  taken  from  the  econometric  literature.  We  shall  stress  the  concep¬ 
tual  rather  than  the  technical  aspects.  A  critical  and  mathematical  treatment, 
drawing  upon  a  wider  range  of  the  econometric  literature,  will  be  given  in 
KALMAN  [1981]. 

The  contribution  of  this  paper,  it  must  be  emphasized,  does  not  reside 
in  proposing  a  new  methodology  for  econometrics.  Rather,  the  contribution 
consists  in  pointing  out  serious  intrinsic  limitations  of  the  existing 
methodological  state  of  affairs  in  the  light  of  superior  knowledge  already 
available  in  system  theory.  The  required  changes  in  econometrics  are  funda¬ 
mental,  important,  and  not  just  a  symptom  of  disagreement  between 


REKADC  Page  4 
Rev.  2,  01/14/81  cb 

researchers  from  different  fields  or  having  different  objectives. 

Some  may  find  these  changes  not  to  their  liking.  I  am  sorry  but  as  a 
scientist  it  is  my  duty  to  call  attention  to  the  reasons  which  mandate 
change.  If  econometricians  ignore  the  very  real  difficulties  discussed 
here,  their  discipline  will  become  sterile  and  effete,  like  a  graying  man 
clinging  to  a  dream,  perhaps  even  a  love,  of  his  early  youth,  still  hoping 
he  can  continue  to  work  as  he  always  did,  without  taking  notice  of  the  many 
wonderful  things  that  were  born  into  the  world  after  the  conception  of  his 
dream. 

2.  THE  MODEL  AS  A  LINEAR  DYNAMICAL  SYSTEM 

Econometric  modeling  from  time  series  requires  system  theory  at  its 
very  first  step:  the  definition  of  the  model.  This  may  seem  a  triviality 
but  turns  out  to  be  crucial. 

Most  of  the  time-series  literature  is  concerned  only  with  linear  models. 
Therefore  we  shall  restrict  our  attention  (here)  to  the  corresponding  concept 
of  a  linear  system.  This  concept  has  been  axiomatized  (see  KALMAN,  FALB, 
and  ARBIB  [1969?  Chapters  1,  2,  and  10])  and  provides  a  reference  point  to 
which  all  further  definitions  and  results  can  be  compared.  It  also  provides 
a  means  whereby  the  current  status  and  claims  of  time-series  modeling  may 
be  scientifically  assessed,  which  is  what  is  done  in  KALMAN  [1981]. 

In  the  definition  of  a  system,  precise  technical  meaning  must  be  given 
to  the  attributes  linear,  finite  (-dimensional  and  finitely  parametrized), 
multi-input/ multi-output,  constant  (=  time- independent  in  its  structural 
properties),  and  dynamical.  lhese  words  are  all  incorporated  in  the  stand¬ 
ard  definition*  which  comes  in  two  versions.  For  continuous-time,  that  is, 
with  the  time  set  T  =  R  =  real  numbers,  a  system  £  is  defined  by 

(2.1)  ^  -  Fx  +  r,u(t),  y(fc)  =  Hx(  t) ,  T  c  R; 

for  discrete-time,  that  is,  with  the  time  set  T  =  Z  =  integers,  a  system 
T.  is  given  by 

*  The  notations  in  (2.1)  and  (2.2),  which  I  introduced  around  i960  to 

honor  my  great  teacher,  F.  G.  H.  Linear,  have  been  universally  adopted. 


REKA3X  Page  5 
Rev.  1,  01/14/81  cb 


(2.2)  x(t  +  1)  =  Fx(t)  +  Gu(t),  y(t)  =  Hx(t),  t  e  Z. 

In  (2. 1-2. 2),  the  real  (or  complex)  vectors  x,  u,  and.  y  are  called 
state,  Input ,  and  output,  respectively;  F,  G,  H  are  matrices  with  constant 
real  (or  complex)  coefficients. 

Tt  is  rather  obvious  from  its  definition  that  a  "system"  (which  will  be 
our  shorthand  for  the  precise  terminology  concomitant  with  (2.1-2))  is  really 
defined  by  the  "data"  (F,  0,  ll) .  So  we  frequently  write  simply 
E  =  (F,  G,  II) .  The  notations  have  been  intentionally  selected  in  such  a  way 
that  there  is  no  built-in  distinction  between  the  continuous- time  and  dis¬ 
crete-time  cases.  Most  of  the  system- theoretic  questions  are  purely  algebraic 
in  nature,  based  on  properties  of  F,  G,  and  H,  and  therefore  such  a  distinc¬ 
tion  is  not  necessary;  the  results  hold  simultaneously  for  continuous- time 
and  discrete- time. 

The  concept  of  a  "system"  goes  back  to  Newtonian  mechanics,  with  the 
very  important  addition  of  the  concepts  of  "inputs"  and  "outputs".  A  good 
mental  model,  especially  in  discrete  time,  is  a  computer.  The  formulation 
of  the  basic  definitions  is  conceptually  valid  in  complete  generality,  with¬ 
out  the  assumption  of  linearity  (see  KALMAN,  FALB,  and  ARBIB,  [19<S9>  Chapter 
1])  .  But  linearity  becomes  essential  if  universal  (that  is,  system- theoretic) 
mathematical  results  are  wanted,  not  just  definitions.  The  power  of  mathe¬ 
matics,  as  currently  applied  to  system  theory,  stems  almost  entirely  from  the 
word  "linear". 

Definitions  (2.1-2)  formalize  the  concept  of  a  system  in  the  axiomatic 
style.  They  provide  a  highly  convenient  starting  point  for  further  discus¬ 
sions.  Sometimes  (2.1-2)  are  called  the  Internal  definitions  of  a  system, 
alluding  to  the  fact  that  the  definition  is  stated  in  terms  of  the  internal 
or  state  variables  (the  components  of  the  vector  x) . 

To  relate  the  axiomatics  to  the  real  world,  it  is  necessary  to  introduce 
the  word  "behavior".  It  has  about  the  same  meaning  in  system  theory  as  in 
econometrics  or  other  applied  fields.  "Behavior"  means  directly  observable 
properties  of  a  system;  for  a  deterministic  system  "observation"  means, 
by  definition  (!),  knowledge  of  the  input  and  the  output;  the  state  is  always 
to  be  regarded  as  nonobservable.  (In  stochastic  system  theory,  observation 
of  the  inputs  is  replaced  by  the  postulated  knowledge  of  their  probability 


REKAIX  Page  6 
Rev.  2x,  Ol/l4/Hl 

distribution.  For  us  here,  this  distinction  is  a  side  issue.) 

The  output  of  a  linear  system  must  be  linearly  (causally)  related  to 
its  input.  Using  nothing  more  than  the  mathematical  definition  of  linearity, 
this  implies  that  for  a  discrete-time  system  the  input/output  relation  must 
take  the  form 

(2.3)  y(t)  =  *4'oAt-TU^T^  =  °* 

We  may  view  (2.3)  as  the  external  definition  of  a  system.  Thus  "behavior" 
for  a  linear  discrete-time  system  is  quantified  by  the  specification  of  an 
infinite  sequence  of  matrices 

(2.4)  S  =  (A.^,  Ag>  ...  ). 

For  continuous-time  systems,  the  definition  of  S  is  again  the  same  as 

(2.4) .  (However,  (2.3)  must  be  replaced  by  the  convolution  integral  and 
the  derivation  of  (2.4),  which  we  omit  here,  turns  out  to  involve  nontrivial 
mathematical  technicalities.) 

Note  that  (2.4)  represents  deterministic  (nonstochastic)  behavior. 

The  behavior  of  a  system  given  by  (2.1-2)  is  easily  calculated  by  the 
algebraic  formulas 

(2.5)  At  :=  HFt-1G,  t  =  1,  2,  ...  . 

Whether  we  take  the  internal  or  the  external  point  of  view  in  defining 
a  system,  we  must  always  bear  in  mind  that  we  are  dealing  with  an  abstract 
mathematical  object.  This  fact  must  not  be  obscured  by  the  fact  that  to 
describe  either  (F,  G,  H)  or  S  we  make  use  of  numbers  (or,  according 
to  common  usage,  "parameters"),  namely  the  elements  of  the  matrices  (F,  G, 

H)  and  of  the  (infinitely  many)  matrices  A^,  A^,  ...  . 

Hie  descriptive  parameters  of  a  system  are  not  at  all  the  same  as  its 
intrinsic  parameters.  The  reason  is  that  a  "system"  is  really  a  nonlinear 
object  and  does  not  admit  parameters  in  the  same  simple  sense  as,  e.  g., 
vectors  in  the  space  Rn.  To  ( intrinsically)  parametrize  (or  coordinatize) 
a  family  of  systems  means  that  each  system  in  the  family  is  given  by  a 


RKKA  i  f.  Raf'c  7 
Rev.  1,  Ol/lU/cil 


unique  set  of  numbers  which  in  not  at  all  a  requirement  Tor  a  descriptive 
parametrization. 

To  understand  this  point  better,  let  us  emphasize  that  the  definitions 

(2.1-2)  are  subject  to  an  equivalence  relation  which  arises  from  re^ardinrr 

two  systems  as  essentially  the  same  when  their  (external)  behavior  is  the 

same.  It  is  easy  to  see  that  a  change  of  coordinates  in  the  state  space 
n  a 

/  ~  R  ,  which  is  written  as  Z  -*  Z  and  defined  by  the  relations 
( "'T  -  TF, 

(2.6)  <  r,  TG  (det  T  i  0) , 

(ht  =  II, 

implies  =  Co,  that  is,  preservation  of  behavior.  The  converse  is  also 

2-j  2-i 

true,  provided  we  relax  the  condition  det  T  4  0- 

To  (intrinsically)  parametrize  a  family  of  systems  Z  subject  to  this 
equivalence  relation  is  a  nontrivial  problem.  Unless  this  is  done,  Z 
cannot  he  identified  from  C  because  the  identification  problem  by  definition, 
relates  onl^'  to  the  behavior  of  a  system,  and  not  to  its  axiomatic  definition. 

From  here  on,  model  will  be  used  as  a  technical  term  for  the  equivalence 
class  '[Z]  of  Z  under  the  equivalence  relation  (2.6).  With  this  termino- 
lo/;y,  the  descriptive  parameters  relate  to  Z  and  the  intrinsic  parameters 
relate  to  !  Z I .  There  is  endemic  confusion  in  econometrics  between  these 
two  sharply  lif'er’Cit  concepts,  as  will  be  discussed  below  (sec,  e.  r,. , 
lection  7  (6)). 


A  similar  distinction  must  be  made  also  with  regard  to  S.  This  is  a 
rather  subtle  theoretical  point.  The  intrinsic  parameters  of  G  arise  from 
imposing  certain  restrictions  on  the  data,  such  as  the  requirement  that  S 
admits  a  finite-dimensional  realization.  This  will  be  discussed  below  and  i* 
lection  6. 

The  problem  of  model  building,  in  the  deterministic  case,  is  to  find  a 

system  Z,  whose  behavior  is  the  same  as  the  observed  behavior  S  =  2.. 

o  L0 

of  some  ( internally)  unknown  system  Z  .  If  Z  is  such  a  system,  that  is, 

if  -  1  .  then  we  call  Z  a  realization  of  C  .  The  mathematical  problem 
o  -  o 


* 


KEKAIX  i  ’n ge  Ij 
Rev.  lx,  Ol/lh/ol 


is  one  of  finding  (F,  0,  ll)  given  all  the  in  (2.5). 

Evidently,  the  model  [>’,„  ]  serves  as  a  kind  of  substitute  for  E  . 

>jo  ° 

If  we  do  not  assume  (and  have  reasons  for  assuming)  that  there  ij3  some 

(unknown)  T.  responsible  for  the  generation  of  the  (known)  Sq,  the  whole 

modeling  exercise  becomes  scientifically  meaningless. 


The  abstract  properties  of  a  realization  as  just  defined  are  sufficiently 

rich  to  allow  important  mathematical  results  to  be  obtained.  The  main  fact 

Can 

is  that  for  canonical  realizations  E  ,  which  always  exist  if  any  reali¬ 
zation  exists  at  all,  there  is  a  bijection  (=one-to-one  correspondence) 


(2.7) 


-  ..  1-1  .  [7'can 


can 

between  the  data  G  and  the  model  [Z  ]. 

O 

Phrased  differently,  this  is  the  classical  result  (1962)  of  deterministic 

realization  theory  that  the  data  uniquely  determines  the  model.  For  this 
can 

reason,  [Zc  J  may  be  regarded  as  a  very  reasonable  substitute  for  the 

►J 

(unknown)  system  E  which  generated  the  data  S. 


Thus  we  may  always  view  the  problem  of  model  building  as  a  deductive 
procedure  described  as 

(2.8)  data  - =*■  model, 


where  the  mathematical  specification  of  the  arrow,  and  indeed  the  solution 
of  the  problem,  is  equivalent  to  the  computation  of  the  bijection  claimed  in 
Theorem  (2.7). 

The  historical  development  of  time-series  analysis  as  well  as  of  the 
related  current  econometric  lore  puts  the  cart  before  the  horse.  It  attempts 
to  determine  the  numbers  specifying  £  from  the  numbers  specifying  G 
before  the  intrinsic  (nonparametric)  issues  are  understood. 

By  theorem  (..7),  the  practical  identification  problem  reduces  to  the 
determination  of  the  intrinsic!  parameters  of  the  model.  Since  there  is  a 
sijective  correspondence  between  data  and  model,  it  is  clear  that  the  intrin¬ 
sic  parameters  of  the  data  (which  is  what  we  really  mean  by  "data")  must 


RKKAIX  Par?-'  9 
Rev.  ]x,  Ol/lJi/'-l 


c 


correspond  !,i  jeetively  to  the  parameters  of  the  model.  So  the  mathematical 
problem  of  intrinsic  paramntrization  may  be  restated  also  as 

(P.9)  i arametrize  data  and  model  so  that  the  bijective  correspondence 
se tween  the  two  is  preserved. 

The  question  of  parameter  identification  doesn't  even  arise.  Realization 
theory  provides  the  rule  for  attaching,  uniquely,  the  (a  priori  unknown) 
intrinsic  parameters  of  [Zaan]  to  the  (a  priori  known)  intrinsic  para- 

O 

meters  of  f,.  Every  parameter  of  [Z]  is  "identifiable"  because  the 
correspondence  (2.7)  is  bijective. 

The  (intrinsic)  parametrization  of  [ZCan]  is  related  to  the  old  math¬ 
ematical  subject  of  canonical  forms  which  is  experiencing  a  renaissance 
under  the  impetus  of  system  theory  (see  TANNENBAUM  [I98I]).  As  far  as 
identification  is  concerned,  the  basic  result  is  (2.7),  which  automatically 
proves  that,  the  "identifiable  parameters"  of  a  system  based  on  the  behavior 
data  2  are  simply  the  intrinsic  parameters  of  [E^an].  If  we  start  from 

O 

a  given  system  7  ,  then  its  intrinsic  parameters  are  those  of  [Egan]. 

0  Lo 

Tn  the  applied  context,  the  elements  of  the  matrices  A^,  A0,  ...  are 
often  called  "benavicral  parameters".  This  is  an  acceptably  plastic  term¬ 
inology  if  we  remember  again  that  these  are  descriptive  parameters,  which 
ha/e  nothing  to  do  with  identification  or  identifiability.  The  are  not 
the  same  as  the  intrinsic  parameters.  In  fact,  it  is  incorrect  to  assume 
that  the  elements  of  the  sequence  A^,  A^,  ...  are  arbitrary  numbers  because 
this  may  be  in  contradiction  to  the  problem  statement  that  S  is  to  be 
realized  by  some  (further  specified  or  restricted)  family  if  systems  Z. 

The  main  ouject  of  the  applied  mathematical  part  of  realization  theory 
is  the  determination  of  the  explicit  numerical  form  of  the  bijective  corres¬ 
pondence  (2.7).  Once  this  is  known,  we  have  in  principle  a  computer  program 
for  the  identification  of  [El  from 

Precise! /  because  the  correspondence  (2.0)  is  bijective,  every  system 
property  stated  in  terms  of  Z  must  have  a  unique  counterpart  as  a  data 
n-oporty  expressed  in  berms  of  f.  The  most  important  question  of  this 
sort  is:  now  can  we  express  the  i’ini  t.eness  of  Z  in  terms  of  3V  The 


I 


Rt.KAIX  I'  age  10 
Rev.  lx,  Ol/l  /r 


t.o  i.i.i:;  question  imp.Li ei  tly  define;;  tin;  intrinsic  parameters  of 

Tf  we  Iodine  dim  >',  :=  size  of  the  square  matrix  F„,  then  the  resul' 
i  s 


(2.10) 

'['he  condition  imposed  on  B„  (sometimes  called  the  behavior  or  Hankel 
matrix  associated  with  the  data  f’>)  shows  that  the  intrinsic  parameters 
of  S  are  not  free  (unlike  the  descriptive  parameters)  but  must  satisfy 
the  condition  rank  If,  =  n.  The  value  of  n  is  not  known  a  priori  (except 
that  it  trust  be  finite)  and  is  to  be  determined  from  the  data  S.  Theorum 
(2.10)  shows  that  this  can  be  done,  in  principle,  by  computing  the  rank 
of  the  infinite  behavior  matrix  B„. 

O 

The  pro  cedi  a::  iiscnssior.  outlines  the  questions  that  must  be  understood 
it.  order  for  the  modeling  problem  to  be  well-defined.  This  is  unfortunately 
not  the  case  it;  a  large  part  of  the  econometrics  literature,  as  we  shall 
show  it  in  some  detail  in  feet  ions  3-5  below. 

Ir.  contrast  to  the  deterministic  case,  the  stochastic  realization  pro  hie 
has  not  yet  given  rise  to  a  definitive  theory.  The  reasons  for  this  lag  may 
well  lie  in  the  preceding  remark.  In  any  case,  as  an  organizing  principle 
for  wading  hhrou  -h  the  existing  conceptual  mess,  we  adapt  the  nondebatable 
criterion 


..  .  .can  , 

mm  =  rank  13,,  =  rank 


(2.11)  Any  stochastic  identification  procedure  must  be  effective  also 
whenever  the  noise  effects  are  arbitrarily  small. 

Accordingly,  5 1  makes  good  theoretical  sense  to  dissect  the  literature 
concerned  with  (linear)  stochastic  realization  with  respect  to  its  treatment 
cl'  the  (lin.-nr)  deterministic  model  which  underlies  any  stochastic  model. 

Here  our  treasment  of  the  stochastic  aspects  will  be  necessarily  rattier 
sketchy.  Lhis,  nowever,  is  not  as  much  of  a  limitation  as  it  might  seem. 


* 


KKKATX  Page  il 
Rev.  lx,  01/ L jf'.'l 


Hi-;  classic'll  .ruiding  principle  of  the  stochastic  analysis  of  linear  systems 
is  that  all  random  inputs  must  be  reduced,  by  suitable  dynamical  modeling, 
to  white  noise.  (This  principle  is  one  of  the  main  factors  responsible 
for  the  success  of  Kalman  filtering,  as  discussed  in  KALMAN  [i960],  [1978].) 
Thus  deterministic  iynamical  modeling  is  almost  always  the  main  task  confront- 
in  •  also  stoch.astic  realization  theory;  the  analysis  of  the  effects  of  white 
noise  is  a tra igh t forward  and  subsidiary.  Thus 

(2.12)  The  first  basic  problem  in  (linear)  time-series  modeling  is  the 

precise  and  proper  specification  of  the  underlying  (linear)  deter¬ 
ministic  dynamical  system. 

3.  IDEHTTF IABILITY :  FIRST  EXAMPLE 

To  give  concrete  form  to  the  preceding  conceptual  discussion,  we  may 
take,  at  random,  some  published  material  from  the  econometric  literature 
dealing  with  identification  and  subject  it  to  system- theoretic  scrutiny. 

Let  us  choose,  for  example,  an  expository  article  on  identification  by 
.3CHOTIFFTJ5  [  19791-  7°  facilitate  referencing  statements  made  in  that  article 

we  reproduce  here,  in  somewhat  paraphrased  English  translation,  the  contents 
of  Lection  1  of  that  article.  Italics  are  those  of  the  original. 

"1.  Introduction. 

"1.1  Intuitive  nackground.  The  following  discussion  is  restricted 
to  the  identification  of  characteristics  f parameters ]  in  stoch.astic 
models.  In  general  terms,  a  characteristic  is  identifiable  prov¬ 
ided  it  can  be  uniquely  inferred  from  the  probability  distributions 
of  the  observed  random  variables.  In  econometrics  identifiability 
is  to  be  regarded  primarily  as  a  necessary  condition  for  the  esti¬ 
mation  of  parameter  one  is  interested  in. 

"Let  us  now  illustrate  the  intuitive  basis  of  these  definitions 
by  an  example." 

At  this  point  there  is  no  possible  objection  to  GCHONFELD’s  argumentation 
since  it  takes  place  on  the  level  of  intuitive  "definitions".  The  difficulty 
immediately  arises,  however,  that  the  notions  of  a  model  and  observed  random 
variables,  cannot  be  understood  with  sufficient  precision  on  this  (intuitive) 
Level.  This  is  seen  sy  proceeding  to  dissect  the  example  given  by  SCHOENFELP 


REKATX  Pane 
Rev.  x,  01/ 1 


"1.:-.  Example.  Consider  the  model  with  the  equation 

(a)  y,  ■:  a-j  +  ufc,  t  •'  T  •=  -1,  0,  1, 

where  u  -  fu  }  in  a  linear  stochastic  process  whose  values 


00 


U.  =  '  K  C, 
t  T=n  T  t«  T 


are  generated  with  the  aid  of  white  noise  e  =  {e^.}  defined  by 
the  assumptions 


t’  e  T, 

K  =  (fc  }  satisfy  the 

conditions 

(d)  |a|  <  1, 

(e)  Kq  -  1. 

(■)  V'>l  '** 

•’"Only  the  (wide-setise)  stationary  process  y  -  {y  }  is 
assumed  to  be  observable." 

"Conclusions: 

(A)  'Jnder  the  general  specifications  of  the  model  as  given 
above,  the  parameter  a  is  not  identifiable.  Reason:  for  each 
admissible  "structure"  s*  =  (a1,  k' ,  e)  and  each  a"  there  is 
a  k"  -  [k'A  ]  such  that  y'  and  y"  from  s'  and 

s"  =  (a",  k" ,  e)  are  identical.  The  observed  process  y  does 

not  permit  differentiating  between  various  (assumed)  values  of 
a'  and  a"." 

(3)  If  we  "sharpen"  the  model  in  such  a  way  that  we  "allow" 
only  processes  of  moving  average  ( MA)  type  to  generate  u, 

with  the  order  of  the  MA  process  being  rio  greater  than  L,  in 

other  words,  if  we  define 

L 


then  a  is  identifiable,  with  the  exception  of  the  "structures" 


(c)  < 

(F.(6t€t.)  =  0  for  t/t',  t, 
where  the  nonstochastic  parameters 


iiKKAIX  i'age  lj> 

Ol/l‘,/8i  e 


Q  (a)  -  0  whore 
K 

(h)  0K(z)  -  V,  K^i'~'r 

(o)  A:;  a  special  case  of  ( P>) ,  cc  will  he  identifiable  if  u 

is  white  noise  (h  0)." 

"1.5.  Remarks.  (A)  '[he  identification  problem  occurs  already 
with  single  equations. 

(3)  Trie  same  problem  arises  also  with  the  parameters  of  a 
"reduced  form". 

(c)  A  parameter  which  is  not  identifiable  in  a  model  may 
very  well  become  identifiable  in  a  "sharpening"  of  this  model. 

Identif lability  depends  crucially  on  the  modeling  assumptions, 
which  therefore  should  always  be  given  completely." 

The  analysis  of  the  preceding  assertions  by  SCHONFELD,  which  are  typical 
of  similar  statements  found  in  the  literature,  requires  a  large  number  of 
remarks. 


(1)  Note  that  "model"  is  not.  precisely  defined.  SCHOENFELD  implies 
that  equation  (a)  is  the  "model".  This  is  not  enough;  in  addition  to  the 
state- transition  equation  riven  by  (a),  it  is  necessary  to  have  a  definition 
of  input  an  1  output..  The  input  should  be  defined  as  u,  given  oy  (h-c) 
and  tie  output  as  y  .  Here,  accidentally,  "state"  and  "output"  are  the 
same. 

(2)  Tim-  corvl it  ions  imposed  on  u^  force  the  input  to  be  the  stochastic 
part  of  the  proeiern.  What  is  intended,  evidently,  is  to  define  a  determin¬ 
istic  (nonst.ochastic)  model,  namely  equation  (a)  plus  inrut  plus  output, 
which  is  to  oe  identified  from  the  probability  distribution  of  y  given 

w 

only  a-priori  postulated  stochastic  properties  (no  observation)  of  the 
input. 

Specifically ,  it  is  assumed  that,  the  input  is  generated  by  white  noise 
anting  as  input  on  a  linear,  stable  dynamical  system  (which  is  what  the 
author  means  by  the  nonstandard  term  of  linear  process) . 

(5)  For  the  i  ■  :,erministic  identification  problem  involving  (a)  to  make 
sense,  f.e  In.. •.  w  ill  have  to  tie  i.nowtu  Then  it  would  follow  ( but  see 


REKAIX  Page  14 

Rev.  1,  01/15/81  c 


nelow)  that  ■:/  is  "identi  fiable".  in  formulating  the  problem  in  this  way 
the  basic  question  would  be:  ts  the  observed  data  y  explainable  by  a 
one-dimensional  model?  in  dCfONFKI.D’ s  example  this  question  is  circumven¬ 
ted  by  the  brute-force  assumption  that  the  system  is,  in  fact,  one-dimension¬ 
al.  This  imposes  a  very  strong  prejudice  on  the  data  y 

O 

On  the  other  hand;  considering  a  random  sequence  u^  and  postula¬ 
ting  that  it  is  generated  by  a  linear  system  (b)  with  a  (scalar)  white 
noise  input  F,^  is  a  very  weak  assumption,  SCHOIfFELD  alludes  to  this  fact, 
rather  imprecisely,  by  talking  about  weak  stationarity.  Roughly  speaking, 
any  weakly  stationary  process  may  Le  modeled  in  the  manner  described  by  (b) 
and  (c).  Thus  weak  stationarity  is  really  the  main  assumption  and  not 
formula  (:•). 

(5)  ut ting  together  the  previous  two  remarks  about  y  and  u^,  we 
see  that  the  example  consists  of  the  combination  of  a  strong  assumption 
(one-dimensionality)  about  the  ''model"  for  y^  with  a  weak  assumption 
(weak-stationarity)  about  the  "model"  for  u^..  This  is,  of  course,  intuitive 
nonsense. 

(6)  rn  precise*  terms,  the  specification  of  the  example  amounts  to  saying 
that  (i)  y+  is  a  weakly  stationary  process  and  (ii)  the  linear  system 
generating  y  has  the  special  property  that  it  admits  in  its  transfer 
function  she  factor  l/(z  -  O').  In  other  words,  the  hypothesis  is  that  the 
system  bias  a  pole  and  the  problem  is  to  locate  this  pole  by  determining  oc. 

(7)  now  let  us  suppose  that  y  (and  hence,  a  forteriori,  ufc)  is 
generated  by  a  finite-dimensional  linear  system  Z  subject  to  white-noise 
input.  In  this  setting,  the  problem  posed  by  SCHONFELD  is  nonsense.  Of 
course,  every  such  finite-dimensional  system  Z  has  a  factor  l/(z  -  Cl) 

in  its  transfer  function.  Every  such  factor  is  identifiable  if  and  only  if 
7,  which  is  responsible  for  generating  y  from  white  noise,  is  identi¬ 
fiable.  However,  unless  E  =  one-dimensional  (which  would  be  equivalent 
to  GCHOMFKU)' s  very  strong  assumption  (c)),  the  problem  of  "identifying  a" 
is  not  well  defined  because  the  transfer  function  of  Z  will  have  many  Cl* s 


and  it  is  no*,  dear  which  oc  is  to  be  identified. 


KKKAfX  .'nye  L  • 
Rev.  Z,  Ol/l  ■ /■<•.! 


c 


"No>.  w LI  l»;fi ti'i'l"  necessarily  implies  "not  identifiable".  The  real 
trouble  is  that,  the  natural  problem  underlying  the  one  circumscribed  by 
:'<110N1’KF.j  is  not.  the  identification  of  Qi  but.  the  identification  of  ( the 
transfer  function  of)  Z,  which  is  something  entirely  different. 


(H)  Next,  takinr  the  case  complementary  to  (7),  let  us  suppose  that 
y  is  generated  by  an  i nfinite-dimensional  system.  Then  it  is  (at  least 
in  the  elementary  theory,  under  the  assumptions  stated  by  SCHONFELD)  not 
clear,  for  reasons  of  mathematical  rigor,  exactly  what  is  to  be  meant  by 
a  pole  of  a  transfer  function.  (For  example,  the  transfer  function  e-z 
has  no  pole.)  !>ut  the  problem  calls  for  locating  the  pole.  Again  the 
nroblein  is  not  well  defined. 


(9)  The  obvious  system- theoretic  objection  to  flUONFELD's  example  is 
■hat  be  ar  'traril;,  single.-'  out,  ev  an  ad-ho  •  assumption  (fore in,:  ut  to 
ne  nonooser/aole) ,  a  nonintrinsic  property  of  the  linear  system  Z  gener¬ 
ating  y  .  As  we  have  seen  in  ruction  2,  the  confusion  arises  from  two 
common  i uberr retatiens  of  the  word  "parameter",  namely, 

( i)  descriptive  parameters,  and 
( ii)  intrinsic  parameter:-. 

CQI'>!.'!  V  a  use  -  "parameter"  in  the  first  sense  when  he  writes  down 
equation  (a) .  When  he  talks  about  "identifying  a",  CCHONFELD  reverts 
to  the  second  sense.  Evidently  c <  is  not  an  intrinsic  parameter  of  Z 
and  so  we  cannot  talk  about  " identify ing"  it. 

In  other  words,  since  equation  (a)  is  not  a  proper  way  of  specifying 
the  model,  relevant  to  the  example,  the  parameter  Ct  that  went  into  (a) 
cannot  be  recovered  from  the  data  constituted  by  the  probability  distri¬ 
bution  ol'  Yj..  Only  intrinsic  system  properties  can  be  expected  to  be 
"identifiable",  not  things  like  the  choice  of  a  coordinate  system,  choice 
of  units  of  measurement,  etc. 


Clearly  the  problem  is  not  well-defined. 

(10)  When  oCHOUFF.LD  speaks,  under  1.1,  of  the  "estimation  of  parameters 
one  is  interested  in"  he  makes  it  quite  clear  that  he  regards  a  "parameter" 
us  an  absolute  attribute  of  a  system.  Apparently  he  takes  it  for  granted 


REKAFX  fare  1-. 
Rev.  1,  01/15/«1 


that  there  exist  such  "absolute"  parameters. 

'Infortun.utoly ,  this  is  merely  wishful  thitikirvr  and  not  a  theorem.  Tn 
.'choral,  nutherrtii'-al  objects,  st'Ii  .ns  a  linear  system,  cannot  be  paramet¬ 
rized  it;  sue*  a  way  that  parameters  have  absolute  significance  like,  for 
example,  mass  in  psysirs.  'Vise  possesses  this  desirable  attribute  because 
it  is  direy  ly  ot  .•••triable  and  context- independent.  Economic  quantities 
( 1: .<«  inflation,  rate  of  saviors,  productivity,  etc.,  etc.)  have  no  such 
absolutely  measurable  attributes  but  are  very  much  context-dependent  and 
interrela'od  wit:,  via’ ;.v  other  economic  variables,  lur  example,  inflation 
Loer  no4  i;.e:m  tie  sa-.e  thin'  iu  a  classical  economy  as  in  a  socialist  one 
or  in  the  hypothetical  one  v/here  all  wa^es,  taxes,  savings,  etc.  are  per¬ 
fectly  indexed. 

Thus  what  we  may  nope  to  identify  in  an  interrelated  situation  is  a 
model  out  not.  a  specific  uni  ririlly  riven  system  of  coordinates  (intrinsic 
parameters)  for  that  model. 

(11)  ill  *  re  is  a  further  element  of  fuzziness  in  ECHOMFELD' s  description 
of  the  pro;  lem.  tie  toes  not  specify  with  mathematical  precision  what  is 
to  .  e  re. -urier  as  ia‘a  for  the  identification  problem.  (He  mentions  under 
1.1  that  identification  is  to  no  based  on  the  "probability  distributions  of 
too  observed  random  variables".  This  is  not  enough  if  wo  want  to  do  calcu¬ 
lations;  for  the,  further  information  or  assumptions  are  needed  about  the 
structure  of  the  u>  lerlyiti,-  probability  distributions.) 

IV  o  conventional  formulation  is  as  follows.  We  take1  y^  (or  u^.)  ns 
a  -aussiar.  (or,  equ; ■- a iontl,  from  the  theoretical  point  of  view,  second- 
orb. -r)  random  r.eqae.-iee.  Then  the  problem  data  consists  of  the  knowledge 
o''  tne  covariances 

(  .la)  cev  (.•  |./|,^),  r  —  0,  1,  . 1 ,  ...  , 

in  addition  •  r.  the  reiativ'  .  .armies:;  normal!  main  -  assumption 
<  '.In)  ,y  0. 

With  trec  e  specifications,  we  finali;,  have  a  well-defined  problem.  It 
i  s  to  determine  the  equivalence  class  [ ]  of  all  linear  systems  L  which 


:<kkaix  hi.:u  1Y 
P.ev.  £>,  Ol/r  A'.l 


'  1TJ1  HAY  7  -.one  rated  the  data  (5.1).  An  element  7,  of  ( 7.]  ts  then 
called  a  realisation  of  (5.1)  and  [.".  |  in  the  model  or  models  we  wish  to 
identify. 

In  practice,  we  pick  out  a  "typical  element"  £._*  of  [£]  and  identify 
that.  If  the  realisation  problem  fails  to  have  a  unique  solution,  then, 
unfortunately ,  the  class  [7. ]  will  contain  mure  than  one  essentially  dif¬ 
ferent  model;  to  classify  the  family  of  essentially  different  elements  of 
[7]  we  will  need  a  now  kind  of  "parameters"  which,  by  definition,  cannot 
no  obtained  from  the  date  (5.1). 

(12)  What  we  nave  .just  described  is  an  example  of  the  stochastic  reali¬ 
zation  problem  in  system  theory.  This  problem  has  an  enormous  literature 
(see,  d.r.  TAhMAd  (19u'i1,  lUdfANE"  and  KAILAUl  [1972],  FAURRE  [1975], 

AKA  I  KM  [  197 '<  I,  •  Ih'.T  [  197-,  1977J,  FAUHHK  et  al.  [1977],  VAN  PUTTEN  and 
7AIJ  dOld'i  !•:!.'  [1979!).  The  problem  is  not  yet  completely  settled  today  in 
that  the  precise  determination  of  the  equivalence  class  of  all  realizations 
of  stochasti  ,*  data  like  (5.1)  is  mathematically  nontrivial. 

(I;1)  Thus  we  nave  arrived  at  a  reformulation  of  the  problem  which  differs 
sv  stuutiaily  from  *e  e  point  of  view  taken  by  HCKONFELD. 

In  accor  lance  wit:  ‘  he  prescription 

i.ata  - -  model, 

hr  e  problem  is  m  •!••'■ ermine  firs1  (usually  only  abstractly)  the  class  [7] 
of  all  models  7  possessing  the  behavior  fixed  by  the  data  (5.1).  For 
example,  the  problem  may  be  posed  in  such  a  way  that  L  must  be  a  linear 
mo  lei.  which  is  the  counterpart  of  the  assumption  that  the  data  is  of  the 
form  (5.  L). 

Huvin"  determined  the  class  f.7l,  there  are  two  remaining  theoretical 
quest ' uus: 

•Jniquenesn.  Does  f7]  have  more  than  one  essentially  different  element'.' 
(i'r.e  next  section  offers  an  example  of  this.) 

; t r : sat  ion. 


Param* 


arametrize  t,he  family  of  all  possible  data  (5.1), 


KKKAIX  i’age  lr 
Rev.  lx,  01/1S/ -1 


thereby  automatically  parametrizing  the  family  of  all  equivalence  classes 
[I,!.  This  is  usually  a  deep  mathematical  problem  in  the  general  realm  of 
system  theory  which  in  many  cases,  especially  those  of  interest  to  econo¬ 
metrics,  is  present  1/  open,  (of  course,  pararnetrization,  in  the  mathemat¬ 
ical  sense,  is  always  what  we  have  called  intrinsic  pararnetrization.) 

Thus  the  question  of  "identii'iability  of  parameter"  does  not  arise  at 
all.  "Parametrizing  the  data"  means  that  the  data  parameters  (which  corres¬ 
pond  to  the  second  type  under  (9))  are  identifiable  by  definition.  Hie 
family  o:  all  [Z|  models  is  then  automatically  parametrized  because  its 
elements  correspond  Uijectively  to  the  problem  data. 

( it)  The  preceding  may  become  clearer  if  we  now  make  the  obvious 
remark  that  a  linear  system  (like  (a))  driven  by  another  linear  system 
(like  (n))  subject  to  a  white  noise  input  is  still  simply  a  linear  system 
subject  to  a  white  noise  input.  If  Z^  is  this  linear  system,  it  may  be 
factored  (cascaded)  as 


(  j.2)  -‘rjy 

wish  ;io  generating  y  from  an  unobserved  stochastic  input  ufc  and 
7.^  generating  u<  from  an  assumed  (but  again  unobserved)  white  noise 
sequence  p:  .  Duc'n  a  factorization  is  not  an  intrinsic  property  of  the 
system  Z^.  ft  can  :>e  performed  in  many  ways.  There  is  no  reason  for 
doing  it  so  that  7,  -  1-dimensional  (the  SCHONFELD  assumption).  Since 

the  factorization  is  necessarily  commutative,  no  useful  statements  can  be 
made  about  the  dimension  of  either  factor  except  that 

dim  Z,  =  dim  Z_  +  dim  Z,.  So  we  see  once  again  that  the  SCHONFELD  uroblem 

1  2  p  -  ■* - 

is  not  well  defined. 

(15)  Wl.cn  fCHOIFFELD  goes  on  to  talk  in  Section  (B)  about  "sharpening" 
the  model,  the  system  theorist  objects.  To  say  that  u^.  is  generated  by 
a  moving  av>*ru"e  process  is  an  ass’.uuption  completely  out  of  the  blue, 
unless  it  is  known  (for  some  special  extraneous  reason)  that  this  is  so, 
in  which  case  the  definition  of  the  model  must  be  modified  to  give  a  new 
"parameter  set"  (01,  0(z)).  See  (h) . 

(16)  To  be  well-defined,  this  new  parameter  set  must  be  subjected  to 


REKADC  Rage  19 
Rev.  2,  Ol/ls/81  ct 


the  restrict  ion  'ha'  .,(z)  /(z  -  a)  has  no  common  factor  (i.e.,  that 
,(■0  /  0).  Tie  "except. ional  case"  Q.(ri)  =  0,  which  CCHOMFELD  interprets 
a.-  lestm;, in  -  t .he  vtenti fiai  ility  of  cc,  is  not  a  case  at  all  as  it  must 
no  rule'l  ou>  it.  a  Ivance  for  the  pronlem  to  be  well  defined.  Co  Q(Ot’)  /  0 
•us  nothin-  to  do  with  identifiability. 

(17)  ’  f  ((  <,  b(  z) )  ,  :.,(ri)  /  0)  is  taken  as  the  new  parameter  set, 

correspondin'  res' rictions  must  be  imposed  also  on  the  probability  distri¬ 
ct  ions  of  ti.e  Deserved  random  variables.  This  means  that  the  data  (J.l) 
must  now  :.e  restricted  t.y  ti.e  condition  that  it  is  generated  from  white 

noise  via  t.ne  model  witii  tratisfer  function  Q(z)/(z  -  Of),  cancellation 

•  • 

not  allow;!.  i  t  is  a  serious  shortcoming  of  SCHONFELD's  discussion  that 
i.e  overlooks  this  point. 

(1*:)  ■•."•"’Tij'c  furtiier  "sharpening"  of  the  model,  namely  the 

postulate  s;. a’  u^  is  a  white-noise  sequence  leads  to  the  correct  claim 
t’nut  O  is  identifiaule,  because  then  E,  the  linear  system  which  generates 
y,  fro::  wi.i  to  noise,  is  one-dimensional  and  questions  of  factorization  and 
purvnet fixation  are  trivial.  Far  from  being  a  "sharpening"  of  the  model, 

■his  is  in  fact  a  /ery  strongly  prejudiced  assumption.  To  fix  the  dimension 
of  !■'.  to  e  1  irrespective  of  ti.e  data  does  violence  to  the  problem  of 
iuuntifi cation  sine-  ,  as  we  have  a  r;  rued  uefore,  dim  E  is  information 
that  mu::‘  be  deduced  from  ti.e  data  and  not  imposed  beforehand. 

(19)  fbH'j’i:'  ET.it’ s  explanatory  remarks  under  (1.3)  are  now  irrelevant. 

In  particular,  it  is  not  trie  that  "a  parameter  which  is  not  identifiable 
in  a  model  may  well  become  identifiable  in  a  sharpening  of  this  model". 

W: ..at  has  happened  is  that  ol  was  not  well-defined  initially  as  an  intrin¬ 
sic  parameter  because  the  model  was  not  well-defined;  if  afAer  imposing 
very  strong  restrictions  the  model  becoi  ;s  well-defined  it  might  well  also 
happen  that  the  parameter  becomes  intrinsic.  This  is  not  an  illustration 
of  iden*  I fiunili ty  nut  a  symptom  of  a  bad  theory. 

We  mu.  roughly  summarize  these  critical  comments  r>y  noting  the  follow¬ 
ing  nasb-  fac‘s  of  life  in  modeling  and  identification: 

( i)  The  ‘irst  theoretical  task  in  any  identification  problem  is  to 
maz.e  sure  t:.e  model  (syn+em  to  lie  identified)  is  well  defined. 


REKAEX  Pape  20 

01/15/81  cb 


(ii)  '.iven  (i),  it  is  possible  to  compute  (in  principle)  the  equi¬ 
valence  class  [".  ]  of  all  systems  7.  which  generate  the  same  (external) 
data . 

(iii)  By  Theorem  (2.7)  data  parametrization  induces  a  parametrization 
of  the  family  of  all  equivalence  classes  [Z].  By  definition,  we  are 
dealing  here  with  intrinsic  parameters  and  the  question  of  "parameter 
•identifiaoility"  is  empty. 

( iv)  If  the  realization  problem  has  a  unique  solution,  we  are  finished. 

(v)  Tf  the  identification  problem  has  a  nonunique  solution,  then 
further  parameters  may  have  to  be  introduced  to  describe  all  the  models 
in  [21  which  are  essentially  different  from  each  other;  such  parameters 
are  of  course  never  identifiable. 

To  put  the  issue  even  more  crudely  and  briefly,  parameter  indentifiab- 
ility  is  not  a  viable  scientific  concept.  The  real  problems  concern  the 
uniqueness  of  realization  and  applied  mathematical  techniques  for  computing 
realization. 

We  do  not  wish  to  create  the  impression  that  GCHONFELD's  article  was 
quoted  out  of  context.  Very  similar  remarks  apply,  for  example,  to  the 
introductory  discussion  of  HANNAN  [1971].  The  examples  given  by  him  illu¬ 
strate  situations  where  the  model  is  not  well  defined.  The  source  of 
confusion  is  again  the  failure  to  make  distinctions  between  descriptive 
and  intrinsic  parameters.  Further  examples  may  be  found  in  KALMAN  [I98I]. 

h.  TDENTTF IABILITY :  SECOND  EXAMPLE* 

To  illustrate  further  the  notion  of  a  stochastic  realization  we  shall 
now  look  at  an  example  taken  from  the  classical  paper  of  KOOPMANS+  and 
mXIEI.c/l  [19501.  It  is  concerned  with  the  identification  of  a  static  rela¬ 
tionship  and  in  that  sense  it  is  trivial  from  the  point  of  view  of  (dyna- 

This  section  war,  not  part  of  the  oral  presentation. 

t  L  am  indented  to  Professor  Koopmanc  for  having  pointed  out  to  re  this 
and  other  related  papers,  nearly  fifteen  years  ago  already. 


REKA1X  Rai SI 
Rev.  xx,  Ol/l^/8l  cij 


nical)  system  theory. 

Consider  a  linear  (more  precisely,  affine)  relationship  between  a  scalar 
input  u  and  a  scalar  output  y  given  (necessarily)  by  two  unknown  para¬ 
meters  ol,  tJ: 


(4.1) 


'  hi. 


'the  information  obtained  about  this  relationship  comes  from  two  noisy 
observations,  which  we  shall  write  (in  accordance  with  the  classical  nota¬ 
tions  of  "Kalman  filtering  theory"  in  KALMAN  [i960]  as 


(4.2) 


zi  11  1  Y] 
.v  •<  v. 


will,  v,,  v  random  variables  subject  to  the  assumption 
A.  L. 

(4.5)  h(v^)  -  0,  i  =  1,  2. 

The  model  for  this  problem  is  riven  by  (4.1-2)  and  by  the  preceding 

specification  of  input,  output,  and  observations.  (Tn  general,  and  also 

here,  for  rocnastic  systems  the  output  is  not  necessarily  the  same  as 

the  observables.)  The  stochastic  environment  is  specified  by  (4.5)  and 

by  some  assumption  concerning  the  probabilistic  relationships  between 

u,  v  ,  ana  v^.  With  KOOPMANC  and  REIERS^L  we  assume  that  u  is  gaussian 

arid  independent  of  v  and  v  . 

-L  c. 

Having  set.  the  stage,  the  problem  is  now:  Are  cc,  3  identifiable  or 
not  identifiable  (under  various  specific  assumptions  concerning  the  joint 
distribution  of  (v1?  v  ),  in  addition  to  those  just  stated  above)? 

1-  C. 

Case  (A).  The  noise  (v,,  vM)  is  jointly  gaussian.  Under  the  assump- 

1  ’  L  c. 

tions  stal.ed,  we  find  that 


(4 .4)  K(.)  = 


R(u) 
i  3K(u) 


Hrrt 


REKATX  I’ age  2 ? 
Rev.  xx,  Ol/l8/f;l 

var  u  +  cov  v. 

Recalling  also  the  zero-mean  assumptions,  it  follows  that  the  stochastic 
environment  is  specified  by  five  unknown  parameters,  namely 

o  p 

(4.0  E(u) ,  a  =  var  u,  h  =  var  (v^)  ,  c  =  cov  (v^) ,  d  =  var  (v~), 

subject  to  the  restrictions 

b 

(4.7)  a  >  0,  cov  v  = 

c 

Under  these  hypotheses,  the  probability  distribution  of  the  vector  z 
is  gaussiari  and  is  specified  by  E(z)  and  cov  z.  Making  the  conventional 
assumption  that  K(z)  and  cov  z  may  be  determined  or  estimated  (since 
the  vector  z  is  observable),  KOOP MAN'S  and  REIERS0L  conclude  that  the  pair 
(Ce,  P)  is  not  identifiable  on  the  basis  of  the  data  E(z)  and  cov  z. 

Their  reasoning  Is  as  follows.  Assuming  cov  z  is  nonnegative  definite 
(as  it  nrust  on  for  a  covariance),  there  are  in  general  many  parameters  P 
which  satisfy  the  obvious  constraint 

'I  COV 

For  eacn  such  r  the  condition  (A.l)  can  be  met  by  suitable  choice  of  the 
environmental  parameters  a,  d  satisi'ying  the  constraints  (4.7). 

For  each  6  equation  (4.4)  defines  a  unique  value  of  oc. 

Even  allowing  for  the  embryonic  state  of  system  theory  in  1950,  I  would 
doubt  if  any  system  theorist  would  have  accepted  this  conclusion  of  KOOPMANU 
and  EK1ERP0T,  in  the  manner  in  which  it  was  first  stated.  The  difficulty 
revolves  around  the  proper  definition  of  the  model  and  of  its  stochastic 
environment. 

First,  let  us  assume  that  c  -  cov  (v^v0)  =  0,  in  other  words,  that 
and  vQ  are  independent.  Then  the  identification  equations  become 


I 


REKATX  i’are  23 
Rev.  2,  Ol/lB/hl  e 


(4.9) 


:ov(z1zg)  =  Pa, 

2 

var(z^)  =  a  +  b, 
[var(z(p  -  P2a  +  d 
FK(z1)  =  K(u), 


V' ( z^)  -  E(y) 


a 


PF*(u) . 


The  same  setup  is  used  in  MFHRA  [  197 f  > »  p.  192-1931  • 

A  solution  (P,  a)  of  the  first  three  equations,  if  it  exists,  must 
satisfy  the  conditions 

(4.10a)  Pa  -  cov  (z^z  ), 


(4.10a)  var  z^  >  a, 


var  z2 

(4.10c)  f - 7 - ri  >  |f|. 

v  cov  ( z^)  =  1  1 

3y  cov  z  ;■  0  a  solution  satisfying  these  relations  always  exists;  in  fact, 
the  set  of  such  (r,  a)  is  just  the  segment  of  the  hyperbola  (4.10a)  deli¬ 
mited  by  the  inequalities  (4.10b)  and  (4.10c).  (in  other  words,  we  have 
here  an  elementary  algebraic-geometric  problem  calling  for  locating  a  certain 
segment,  or’  a  curve  riven  by  algebraic  equations  and  inequalities:  the  exist¬ 
ence  of  the  solution  coincides  with  the  probabilistic  requirement  that 
cov  z  >  0) .  Given  any  such  admissible  pair  (P,  a) ,  the  values  of  b  and 
d  are  determined  from  (4.5)  and  a  is  determined  from  (4.4).  Obviously 
the  solution  is  not  unique. 

Second,  the  assumption  that  cov  (v^v^)  =  0  is  hot  a  luxury  or  a  loss 
of  generality  but  indispensable  for  the  correct  discussion  of  this  problem. 
System- theoretically  the  problem  is  to  estimate  the  (affine)  effect  of  a 
variable  u  on  another  variable  y  when  u  and  y  can  be  observed  only 
in  a  noisy  way  as  z^  an  i  .  Recall  that  the  only  assumption  (knowledge) 
about  v^  and  vn  is  (4.5)  and  that  they  are  jointly  gaussian.  If  there 
were  (nonzero)  correlation  between  the  values  of  v^  and  this  would 


RKKA1X  Page  ?'■ 
Rev.  lx,  Ol/lO/hl. 

imply  that  parf  of  the  eause-and-efiV.-ct  relationship  between  z^  and 

is  to  be  explained  by  the  model  (4.1)  whereas  another  part,  of  unknown 

amount,  is  to  be  explained  by  the  correlation  between  the  noise  components 

about  which  nothing  is  known  quantitatively.  (Indeed,  if  cov  (v7v0)  J  0 

we  may  always  regard  v  as  ye ne rated  by  another  linear  model 

v  =  pv  +  v  ,  with,  fi  fixed  by  the  covariance  between  v  and  v  and 
X.  ez  JL  c. 

v0  independent  of  v  .)  This  is  a  highly  unnatural  assumption  if  the  exer- 

c-  i. 

cise  is  to  be  relevant  to  the  real  world.  Hence  in  this  respect  the  KOOP- 
MAWS-RE IKRC^L  example  must  be  modified  by  insisting  on  cov  ( v- v^)  ~ 

The  nonuniqueness  of  the  solution  of  this  realization  problem  is  unfor¬ 
tunately  a  fact  of  life  and  has  been  observed  in  many  contexts.  (The  inter¬ 
esting  discussion  of  WOLD  [1972]  concerning  the  need  for  causality  assump¬ 
tions  in  addition  to  statistical  techniques  in  the  treatment  of  regression 
is  highly  enlightening  here.) 

The  parameter  c<  plays  an  uninteresting  role  in  the  preceding  analysis 
since  the  treatment  of  the  mean  is  essentially  a  deterministic  linear  system 
problem.  Notice  also  that  the  assumption  E(v^)  =  E(v^)  =  0,  which  is  made 
by  r'.O'yVAU:'  arid  F.KTSP. "/■L,  plays  a  role  similar  to  the  assumption 
cov  (v^v,,)  ••=  0,  which  was  not  made  by  them.  If  we  didn't  assume 
y.(v  )  =  E(v0)  =  0  a  part  o'’  ex  would  be  explained  by  the  model  (4.1)  and 
another  part  by  the  noise. 

Case  ( R) .  Trie  noise  (v^,  v  )  is  not  gaussian.  In  this  case,  it  can 
ne  shown  that  (or,  •:)  are  identifiable.  (For  the  general  analysis  of  the 
problem,  see  Rh  I  [  19**0  ] .)  System- theoretically  this  statement,  too, 

re-quires  discussion.  If  the  joint  distribution  of  (z^,  z^)  in  not  gauss ian 
then,  roughly  speaking,  the  cause-and-effect  relation  between  them  is  not 
linear.  Consequently  this  result  of  REIERG0L  must  be  viewed  as  an  isolated 
glimpse  at,  nonlinear  realization  theory.  Having  made  such  an  odd  assumption 
about  t'ne  noise-  it  is  mandatory  to  explain  why  the  implicit  nonlinear  depen¬ 
dence  uetween  z^  and  z0  siiould  be  modeled  linearly  by  (4.1). 

To  th'-  -hyrncist,  tire  id-  Rf^L  example  amounts  to  having  to 

estimate  r-ie  valu-.-s  of  two  resistors  connected  in  series  such  that  only 
the  sum  of  resistances  is  available  for  measurement. 


REKATX  Page  2) 
Rev.  lx,  Ol/l8/cl 


;  ;i  tic :  nonlinear  realization  theory  is  not  yet  a  developed  subject,  it 
is  not  possible  at  present  to  give  a  deductive  discussion  here  in  the  style 
of  Case  (a)  . 

Fn  summary,  we  may  say: 

(i)  The  KOOPMAH. RKlKhd^L  example  is  a  system- theoretic  problem. 

(ii)  It  is  not  well  defined  unless  c  =  cov  (v-j_vp)  =  ^or  otherwise 

the  causal  effects  are  arbitrarily  divided  between  a  model  to  be  determined 
and  noise  about  which  nothing;  is  assumed  or  observable. 

(iii)  With  the  normality  assumptions  and  c  =  0,  we  have  a  well-defined 
(elementary)  problem  in  linear  stochastic  realization  theory. 

(iv)  This  problem,  unfortunately,  has  a  nonunique  solution,  which  is 
typical  of  stochastic  realization  problems.  This  conclusion  agrees  with 
the  intuitive  judgment  that  such  formulations  are  acceptable  "provided  the 
temptation  to  specify  models  in  such  a  way  as  to  produce  identifiability  of 
relevant  characteristics  (i.e.,  parameters]  is  resisted"  (from  K00PMAN3  and 
RETSR30L  ( 19^0 ,  beet.  2.^,  p.  169]) . 

t 

(v)  REITPP^V  3  analysis  of  the  case  z  ^  gauss ian  will  be  a  benchmark 
of  nonlinear  realization  theory  when  the  latter  comes  into  being. 

(vi)  till!  PA  ( I97I.I  shows  that  the  nonuniqueness  of  the  stochastic  real¬ 
ization  problem  pose  1  uy  Y.'YY.  VMW-  and  EETEPS0L  (with  cov  (v^v  )  =  0  being 
explicitly  assumed  by  MPHRA.)  can  be  removed  by  reformulating  the  problem. 

Mil  IRA  assumes  that  both  u  and  v.^  are  random  sequences,  the  first  having 
correlation  properties  and  the  second  being  white.  Then  it  is  possible  to 

p  p  2 

determine  E(u  )  and  F.(v^)  separately,  not  just  their  sum  E^),  which 
makes  the  solution  unique. 

This  is  a  much  more  realistic  formulation  of  the  problem;  it  recognizes 
the  distinction  between  the  causal  variable  (the  sequence  u)  and  the  noise 
(the  sequence  Vj) ,  while  KOOPMANd  and  REIERd^L  grant  u  and  v^  suosti- 
tute  for  causality  hypotheses  in  any  statistical  identification  problem 
since  statistics  is  neutral  about  what  causes  what. 


« 


REKATX  Par..* 

Rev.  Ol/lU/8l  c 


3.  THE  "ARMA"  MODEL 


The  widespread  use  in  econometrics  of  ARMA  models  (see  BOX  and  JENKING 
I 197o])  raises  system- theoretic  problems  which  require  discussion.  We  can 
now  amplify  the  comments  made  in  Section  3> 

The  general  model  of  this  type  is  often  described  in  econometrics  in 
the  following  terms  (see,  for  example,  DEISTLER  [1978]): 


(5.1) 


Q  y,  -  >  N  u,  +  v,  . 

r=o  r  t-r  s=o  s  t-s  t 


The  vector  variables  y  and  u^  have  the  same  intuitive  significance 
as  in  the  example  of  Section  3;  the  vectors  v^  are  additional  error  terms. 

We  shall  consider  only  the  deterministic  aspects  of  the  problem.  They 
revolve  around  the  question:  In  what  sense  does  (5»l)  determine  a  linear 
system? 

(1)  Equations  (3.1)  describe  a  system  in  the  external  sense;  there  are 
no  state  variables. 

(2)  For  the  output  sequence  y  to  be  determined  from  in  input  sequence 

1 

u.  we  must  have 

(3.2)  let  Cl(z)  r.  0 

(an  a  polynomial),  where  Q(z)  is  the  matrix  polynomial 


n  r 

(3.3)  Q(z)  jZq  Qrz  1  • 

(DKIGTLER  assumes  det  ^  0;  this  is  unnecessarily  restrictive.) 

(3)  Plow  we  can  write  Q,  ^(z)?f(z),  with  N(z)  the  matrix  polynomial 


(3.4)  N(z) 


S~G 


Q  ^(z)ll(z)  is  a  rational  matrix  and  therefore  it  has  a  formal  Laurent 


REKACZ  Rage  2{ 
Rev.  x,  01/ iR/Pl 


c 


series  a:. out,  7.  ^  *>.  To  :>e  able  to  relate  this  series  to  the  transfer 
function  of  the  underlying  syst  err;  it  is  necessary,  for  reasons  of 

causality  and  normalisation,  that 

(  5 .  s)  Q_^(  ■/,)  :i(  z)  _  proper  rational  matrix. 

(D1I5TL.RR  does  not  discuss  this  point.) 

(4)  The  last  assumption  means  that 

(5.6)  0_1(z)N(z)  ,  tT^Atz'\ 

VJlie re  =  holds  in  the  sense  of  formal  power  series.  We  are  now  in  the 
situation  of  bavins;  given  a  standard  external  description  S  =  (A^,  A 2,  ...  ) 
of  the  system  in  the  form  (2.4).  Of  course,  S  is  "identifiable",  by 
definition,  since  this  is  the  data  to  which  the  identification  problem  is 
ultimately  referenced. 

(5)  DK r.OTfJii?  refers  to  "conditions"  for  the  identifiability  of  (5-5). 
Presumably  he  means  thereby  relations  between  the  descriptive  parameters 
of  Q(z),  N( z)  ,  which  are  the  matrices  Qq,  Q^,  ...,  Qn^;  Nq,  ...  N^, 
and  the  descriptive  parameters  of  S,  which  are  the  matrices  A^,  A^,  ...  . 
However,  one  cannot  consider  such  things  until  after  the  system  is  well 
defined.  This  obviously  requires  introducing  the  equivalence  relation 

(5.7)  (Q(z),  N(z))  ~  (Q.(z),  N(z)) 
defined  by 

( 5.8)  0”1(  z)  H(  z)  -  Q_1(  z)  N(  z)  . 

The  equivalence  relation  (5.7)  is  needed  to  prevent  the  system  to  be  ill- 
defined  due  to  the  possibility  of  cancelable  factors  between  Q(z)  and 
H(z). 

(6)  Conditions  (5.2')  ,  (5.5),  and  the  equivalence  relation  (5*7)  are 
obviously  necessary  for  the  abstract  map 


REKAIX  Paw  ?fl 
Rev.  x,  Ol/lt/bl 


(5.9)  (Q(z),  PJ(z))  — ~  :: 

: '•i vo! i  by  ('j.1'')  to  bo  inject  ive. 

To  establish  that  thir.  nap  is  hi jective in  other  words,  that  we  can 

legitimately  tall;  about  (5*1)  as  a  well-defined  external  description  of 
some-  underlying  system - we  must  chow  that  there  exists  an  injective  map 

(>.10)  f,  - (o(z),  il ( z) ) 

satisfying  (5.  A)  'this  is  mathematically  nontrivial. 

Assuming;  C  has  a  finite-dimensional  realization,  realization  theory 
shows  that  there  is  a  proper  rational  matrix  Z(z)  whose  formal  power 
series  agrees  with  the  right-hand  side  of  (5.6).  It  can  be  shown  further 
that  every  proper  rational  matrix  admits  a  factorization  as 
Z(z)  =  Q-1(  z)  N(  z)  . 

This  proves  the  existence  of  an  injective  map  (5. 10).  Then  it  follows 
that  the  correspondence  between  S  and  (Q(z),  N(z))  can  be  made  bijective 
with  the  aid  of  the  conditions  mentioned  above. 

(7)  The  bi.jective  relationship  between  the  two  external  descriptions 
is  established  abstractly  and  has  nothing  to  say  about  the  descriptive 
parametrizations  mentioned  under  (4)  .  The  question  of  intrinsic  paramet- 
rization  is  left  open.  For  S,  the  dimension  of  the  corresponding  canon¬ 
ical  realization  is  given  by  (2.10).  It  is  necessary  to  prove  an  analogous 
formula  for  ((),( z)  ,  M(z)).  Then  canonical  forms  can  be  derived  for  Q(z) 
and  N(  z)  which  exhibit  the  intrinsic  parametrization.  This  is  by  no 
means  a  simple  mutter  mathematically;  the  reader  is  referred  to  KALMAN 
[1981]. 

In  any  case,  it  is  certainly  not  possible  to  fix  the  values  n^  and 
n0  (as  is  lone  by  DTINTLna  (1978])  because  these  quantities  do  not  have 
any  simple  relationship  to  the  underlying  system  Z.  In  fact,  it  is  not 
possible  to  give  a  single,  globally  valid"  formula  for  the  object 
(Q(z),  NOO)  because  its  intrinsic  (canonical)  parametrization  requires 
knowledge  of  the  so-called  Kronecker  indices  (see  KALMAN  [1971].  These 


REKAEX  Pare  ' ') 
Rev.  x,  Ol/18/RL 


are  a  global  property  of  the  underlying  Z  which  govern  the  places  where 
the  canonical  parameters  appear  in  the  matrices  Qq,  ...  N^. 

Subject  to  these  technical  details,  however,  the  question  of  how  to 
define  a  deterministic  linear  dynamical  model  (external  sense)  with  the 
aid  of  a  pair  (Q,(z),  N(z))  is  completely  settled  at  the  present  time. 

(This  amounts  to  assuming  u^  =  known,  =  0.)  It  should  be  mentioned 
that  the  mathematical  analysis  required  here  is  relatively  recent  even  in 
system  theory  and  was  developed  mainly  during  the  last  ten  years  under  the 
impetus  of  the  book  of  ROSEIIBROCK  [1970]. 

(8)  When  we  come  to  the  stochastic  case  (the  probability  distributions 
of  u,,  v  are  known  but  not  the  actual  values),  then  it  would  seem  (to 
the  writer)  that  the  situation  is  not  yet  clear,  in  spite  of  much  recent 
work.  In  particular,  it  is  not  clear  under  what  general  conditions,  if  any, 
the  stochastic  realization  problem  admits  a  unique  solution. 

This  discussion  again  shows  that  "parameter  identifiability"  is  a 
nonpros Ion:  for  the  model  formalized  in  terms  of  (5.1) . 

The  AkMA  scheme  defines  a  generic  linear  system.  This  is  undoubtedly 
a  major  reason  for  the  success  of  methods  based  on  it.  Dropping  either 
the  "AR"  or  the  "MA",  in  other  words,  considering  only  moving-average  or 
autoregressive  models,  genericity  is  lost  and  further  serious  conceptual 
difficulties  arise. 


6.  MORE  PITFALLS 

"Parameter  identifiability"  is,  intuitively,  a  rather  appealing  notion. 
Why  did  it  fail  for  linear  systems?  Ihe  reason  is  simply  that  the  develop¬ 
ment  of  a  particular  field  of  science - system  theory — has  reached  a  point 

where  results  from  a  subfield - realization  theory - are  available  to  subject 

the  intuitive  notion  of  "parameter  identifiability"  to  rigorous  scientific 
analysis.  When  this  is  done,  "parameter  identifiability",  after  having  been 
tested  on  the  precise  and  concrete  case  of  a  linear  system,  collapses  as 
a  workable  ■•oncupt — and  yet  it  must  work  for  linear  systems  if  it  has  any 


RKKA  DC  Pace  ;'.0 
Rev.  2,  Ol/l8/81 


theoretic  merit  at  all. 

A  similar  application  of  system  theory  can  be  made  to  assess  the  merits 
of  the  moviti  -  average  (MA)  and  autoregressive  (AR)  types  of  models,  which 
were  proposed  by  time-series  analysis  Ions;  before  system  theory  existed 
(certainly  by  the  late  1920's).  Hie  study  of  these  very  models  provided 
some  of  the  stimuli  for  the  development  of  system  theory. 

Are  these  models  good  or  bad?  Any  pood  system  theorist  would  immediately 
reply,  "bad".  The  .justification  of  tills  emotional  conclusion  by  rigorous 
methods,  however,  is  not  at  all  trivial;  in  fact,  it  was  made  possible  only 
by  the  recent  development  of  the  so-called  "partial  realization"  theory 
(see  KALMAN  [1971,  1979c]). 

The  partial  realization  problem  arises  when  S  in  (2.4)  is  given  only 
partially,  that  is,  by  a  finite  sequence  of  matrices  A  ,  ...,  A  .  Then 
the  realization  of  minimal  dimension  may  not  be  unique,  but  of 

course  n^  is  unique  because  of  the  requirment  of  minimality.  Hie  analysis 
of  as  a  function  of  t  provides  very  important  mathematical  information 

concerning  the  classical  realization  problem  (see  KALMAN  [1979c]).  Since 
n+  is  a  monotone,  nondecreasing,  integer-valued  function  on  the  integers, 
its  value  can  change  only  in  "jumps”.  The  structure  of  these  jumps  turns 
out  to' satisfy  strong  regularity  conditions  (see  below).  Hie  generic  case 
occurs  when  all  jumps  are  equal  to  one  and  the  jumps  occur  at  all  odd  values 
of  T. 

Hie  AR  and  the  MA  schemes  constitute  a  constructive  existence  proof 
that  for  every  t  the  partial  realization  problem  has  a  (finite)  solution. 
Mathematically,  this  is  a  very  trivial  fact.  It  follows,  unfortunately, 
that  this  is  the  only  system- theoretic  idea  inherent  in  AR  and  MA. 

To  be  precise,  the  AR  scheme  applies  if  and  only  if  n^.  is  a  function 
with  a  single  jump.  Hiis  is  a  highly  nongeneric  situation.  Consequently 
it  usually  does  not  occur  in  nature.  Since  the  "jumps"  are  the  basic 
phenomena  in  the  realization  problem,  it  must  be  possible  to  find  a  statis¬ 
tical  test  for  the  occurrence  of  various  patterns  of  jumps. 


Such  a  test  has  not  yet  been  developed  (in  the  writer's  knowledge) 


REKAIX  Pare  yl 
Rev.  1,  01/18/81 


Consequently,  the  application  of  AR  models  to  real  data  is  system- theor¬ 
etic  nonsense.  When  an  AR  scheme  is  applied,  usually  it  is  out  of  pure 
prejudice,  without  statistical  evidence  that  the  very  unlikely  case  is  in 
fact  at  hand.  Lt  cannot  be  argued  that  AR  fits  the  data  because  ARMA 
will  fit  the  data  even  better  since  ARMA  is  generic  (see  Section  5) • 

It  is  an  amusing  fact  that  for  n  =  1  it  is  trivially  true  that 
AR  -  ARMA.  Consequently  there  is  nothing  wrong  in  applying  first-ordjr 
autoregression;  it  is  when  we  attempt  to  jump  to  the  n-th  order  case  that 
the  idea  collapses.  Generalizations  of  this  type  frequently  lead  to  a 
system- theoretic  problem  which  is  far  more  difficult  than  it  seems. 

Similar  comments  apply  of  course  also  to  the  MA  scheme. 

Another  application  of  partial  realization  theory  concerns  the  "para¬ 
meters"  of  the  data  S  =  (A^,  A^,  ...).  Let  us  take  the  scalar  case 
S  =  (a^,  a^,  ...),  where  the  a^  are  real  numbers,  since  the  theory 
given  in  KALMAN  [1979G1  is  restricted  to  this  case.  Such  a  scalar  sequence 
may  arise,  for  example,  as  a  discrete-time  autocovariance  function. 

The  theory  in  KALMAN  [1979c]  applies  to  any  such  sequence  (there  are  no 
conditional).  Consequently  any  "data"  a^,  a£,  ...  has  a  certain  pattern 
of  jumps'  associated  with  it.  If  a  jump  of  size  q^  occurs  at  the  time 
point  t^,  the  following  statements  can  be  made: 

(i)  The  value  of  a-^  /  a-^  :=  value  of  a-t^  computed  from  the  (unique) 
partial  realization  based  on  a^,  ...,  a-t,^  using  formulas  (2.5).  (This 
statement  is  meaningful  because  the  main  theorem  of  partial  realization 
theory  guarantees  that  "there  can  be  no  jump  before  uniqueness".)  Thus 
a^  is  not  an  arbitrary  parameter  because  otherwise  there  is  a  contridic- 
tion  to  the  inherent  jump  pattern. 

(ii)  After  a  jump  of  size  q  exactly  q.  elements  &t-+l>  at-. 

X  1  1  Xt  (Jj 

of  the  sequence  are  "free";  that  is,  any  values  of  these  parameters  may 
occur  without  contradicting  the  jump  pattern. 

(iii)  Before*  a  jump  of  size  q^,  exactly  q^  -  1  elements  of  the 
sequence  are  fixed,  that  is,  they  are  uniquely  determined  via  (2.5)  by  the 
partial  realization  based  on  the  first  2nt^  ^  elements  of  the  sequence. 


« 


RKKA  T< 


01/i'V'i 


'rhis  ahow^  that  it  is  very  naive  to  speak  of  C  =  (a^,  a^,  ...)  as  a 
sequence  of  (intrinsic)  parameters,  if  we  take  parameter  in  the  usual  sense 
of  being  any  (real)  number.  Only  the  q^  elements  of  the  sequence  follow¬ 
ing  the  i-th  .jump  point  (type  (ii))  qualify  as  parameters  in  this  sense. 
The  type  ( i)  elements  must  satisfy  an  £  condition  and  type  (iii)  elements 
(which  do  not  occur  in  the  generic  case)  are  completely  fixed.  Moreover, 
and  this  is  the  crucial  point,  the  location  of  the  various  types  of  elements 
is  fixed  by  the  jump  pattern,  which,  as  an  intrinsic  property  of  the  data, 
has  not  been  discovered  in  the  time-series  literature  prior  to  KALMAN  [1971, 

19T9c % 

Thus  we  see  that  at  present  even  the  preliminary  question  of  parametri- 
zation  of  the  data  raises  deepseated  theoretical  questions. 


7.  CONCLUSIONS 

System  theory  is  a  new  paradigm.  It  applies  to  economics  in  at  least 
two  ways:  through  the  study  of  systemic  properties  of  any  economic  model 
and  through  the  critique  of  the  econometric  recipes  of  modeling.  We  have 
studied  here  the  second  aspect. 

The  development  of  science,  certainly  in  regard  to  economics,  has  now 
reached  the  stage  where  Newton  has  become  a  very  poor  role  model.  The 

Newtonian  approach - isolate  a  phenomenon  and  try  to  analyze  it  in  its 

simplest  appearance  without  regard  to  the  context - is  precisely  what  is 

inapplicable  to  the  problems  of  economics,  because  economic  phenomena  are 
intrinsically  system  (context)  related. 

The  aspiration  of  econometrics,  to  develop  hidden  quantitative  relation¬ 
ship  from  interrelated  data,  is  a  problem  which  belongs  to  the  domain  of 
system  theory.  To  be  successful,  this  program  must  be  supplemented  by  close 
attention  to  more  advanced  problems  arising  from  modeling,  for  example,  the 
question  of  intrinsic  parameters  and  their  determination  from  real  data. 

It  is  certainly  not  enough  to  think  in  terms  of  parameters  in  the  naive 
cense.  In  particular,  we  have  shown  that  the  intuitive  notion  of  "identi- 
fiability  of  a  parameter”  cannot  be  developed  into  a  meaningful  scientific 
concept  and  must  be  replaced  by  the  apparatus  of  realization  theory. 


IffiKADC  Pan;  55 
Rev.  2x,  OI/18/81 


Let,  'is  close  : iy  recalling  the  position  taken  twenty-five  years  ago  by 
-  von  III-:- rM,\r::i  |  19'- 8  ]  in  one  of  his  last  public  statements.  Tn  response 
•0  a  discussion  topic  concerning  the  potential  scientific  development  of 
economics,  he  denied  that  progress  would  be  impeded  because  of  the  "impos- 
ni.-ility  of>  experiments"  (classical  astronomy »  a  successful  science  without, 
experiments,  neiir;  his  counterexample)  or  hindered  by  "lack  of  data"  (many 
scientific  advances,  such  ns  FINSTE LN' s  photoelectric  law  and  even  more 
his  general  relativity,  were  conceived  when  there  was  very  little  available 
data).  The  most  important  missing  element  in  economics,  von  NEUMANN  insisted, 
was  the  "definition  of  caterories". 

What  he  meant  by  this  is  articulated  in  contemporary  terminology  by  the 
words  "invariants",  "structure",  "decomposition",  "elements",  etc.  If 

modeling  of  economic  time  series  is  to  have  relevance  for  economic  theory - 

and  in  that  hope  we  are  all  united--- then  system  theory  must  be  able  to  dir 
up  von  JNVUni’ s  missing  categories  by  penetrating  more  deeply  into  the 
theory  of  models. 

Thom  are  many  results  and  roach  research  in  system  theory  today  concerned 
with,  exactly  the  same  problem.  Econometrics  roast  also  contribute  to  its  sol¬ 
ution,  or  wither  as  an  irrelevant  exercise  in  statistics. 

The  task  of  going  from  real  data  to  economic  (or  any  other)  theory  may¬ 
be  attacked  in  many  ways.  The  scientific  approach  is  not  compulsory.  Astro- 
log:/  has  been  tried.  No  optimist  would  quarrel  with  the  declaration  of  one 
of  von  NEUMANN* s  direct  successors  that  "exposure  to  the  'real  world'  of 
economic  policy. . .affirmed  rather  than  eroded  [my]  belief  in  the  usefulness 
and  relevance  of  economic  theory"  (see  WHITMAN  [1979>  page  x]) .  Innocent 
faith  sometimes  moves  mountains.  But  don't  hold  your  breath.  It  is  better 
to  sit  down  and  to  begin  by  cleansing  the  field  of  misconceptions. 


» 


AKA  !KK 
[  197't  1 

f.  !« >x 

[197"] 

9K  !”T!  :v  ’ 

( 197  *"  1 

FA'JRRK 

[1975! 

i-A’ihr:  •,  v„ 
i 1979 1 

j.  HAiriAir 
r 1971 1 

KARMA.! 
[  i9''o ! 

[197'.! 

f  19  9 ! 
f 1971 ! 

[197:  1 


RKKAIX  Mage  34 
Rev.  x,  OI/19/81  co 


"stochastic  theory  of  minimal  realization",  IEEE  Transactions 
on  Automatic  Control,  AC-19:  667-674. 


arrl  0.  -■!.  JENKINS 

Time  Series  Analysis:  Forecasting  and  Control,  revised  edi¬ 
tion,  Holden-Day,  575  pages. 


"The  structural  identif iability  of  linear  models  with  auto- 
correlated  errors  in  the  case  of  cross-equation  restrictions", 
.Journal  of  Econometrics,  8:  23-31* 


"Realisations  markoviennes  de  processus  stationnaires",  Re¬ 
search  Report  No.  13,  TRIA,  Rocquencourt,  FRANCE. 

CLKENKT,  and  F.  GERMAIN 

Op^rateurs  Rationnels  Positifs:  Application  d  l’Hyperstabilite 
et  aux  'rocessus  Aldatoires,  Dunod,  1979>  294  pages. 


"The  identification  problem  for  multiple-equation  systems  with 
moving  average  errors",  Econometrica,  32:  751-765. 


"A  new  approach  to  linear  filtering  and  prediction  problems", 
Journal  of  Basic  Engineering  (Trans.  ACME),  82  D:  35-45. 

"Linear  stochastic  filtering  theory - reappraisal  and  outlook", 

in  Mroc.  Symposium  on  System  Theory,  edited  by  J.  Fox,  Poly¬ 
technic  Institute  of  Brooklyn,  pages  197-205. 

Topics  i:i  Mathematical  System  Theory,  McOraw  Hill,  358  pages. 

"On  minimal  partial  realizations  of  a  linear  input/output  map", 
in  Aspects  of  Network  and  Jvstem  Theory  (a  collection  of  papers 
in  non.,r  of  E.  A.  Guillemin)  ,  edited  by  R.  E.  Kalman  and  N. 
'■"Clarir.,  Molt,  Rinehart,  and  Winston,  pages  385-408. 

"Kronecker  invariants  and  feedback",  in  Proc.  1971  NRL-MRS  Con¬ 
ference  on  Ordinary  Differential  Equations,  edited  by  L.  Weiss, 
Acaiemie  Press,  pages  459-471. 


« 


REKA  DC  Fare  33 
Rev.  x,  Ol/l9/fll  c 


[197'  I  "Realization  theory  of  linear  dynamical  systems",  in  Control 
Theory  and  Functional  Analysis,  Vol.  II,  International  Atomic 
Hnorry  Agency,  Vienna,  1976,  pages  235-250. 

[I978I  "A  retrospective  after  twenty  years:  from  the  pure  to  the 
applied",  in  Applications  of  Kalman  Filter  to  Hydrology, 
Hydraulics,  and  Water  Resources,  edited  by  Chao-Lin  Chiu, 
Department  of  Civil  Engineering,  University  of  Pittsburgh, 
pares  S3-^9« 

[1979al  "A  system- theoretic  critique  of  dynamic  economic  models",  in 
Clonal  and  Larre-Scale  System  Models,  edited  by  B.  Lazarevic, 
Springer,  pages  1-24. 

[  19Y9!  1 


[  1979'-'  1 


f 1980a  1 


[ 19' 0  !  "I  y Mamie  econometric  models:  a  system- theoretic  critique"  in 
Cow  Quantitative  Techniques  for  EcoTiomic  Analysis,  edited  by 
A.  Cell inn  and  9.  P.  Szero,  Academic  Press,  pages 

[  1981  j  "Unitary  models  for  time  series  and  the  identification  of 
dynamics",  in  Developments  in  statistics  (bock),  edited  by 
R.  Krishnaiah,  Academic  Press,  pages 

t.  c.  ko'U’u/.;:;'  n-u  o.  oftfpp^p 

( I9CO 1  "The  identification  of  structural  characteristics",  Annals  of 
'•‘uthe::. -tticHi.  statistics,  21:  IbC-lSl. 

R.  K.  ffliy 

[1978!  " fdentificalion  and  estimation  of  the  error- in-variables 

model  ( KVM)  in  structural  form",  in  Mathematical  Programming 
r.tudy  (Uorth  Holland)  vol.  5,  pages  191-210. 

J.  von  NEUMANN 

[19r;  J  "The  impact  of  recent  developments  in  science  on  the  economy 

and  1)1.  economics",  summary  of  speech  before  the  National  Plan¬ 
ning  Association,  Washington,  DC;  in  Looking  Ahead,  4:  11; 
reprinted  in  the  author's  Collected  Works,  Vol.  VI,  pages  100-101. 


"Theory  of  modeling",  in  Proceedings  of  the  IBM  System  Ccience 
Symposium,  Oiso,  JAPAN,  edited  by  Y.  Nishikawa,  pages  53-°9* 

"On  partial  realizations,  transfer  functions,  and  canonical 
forms",  in  Acta  Polytechnica  Ccandinavica,  Mathematics  and 
Computer  Science  ferles  No.  pi,  pages  9-32. 

"Uystem- theoretic  critique  of  dynamic  economic  models",  Inter¬ 
na  .iona.'1  Journal  of  Policy  Analysis  and  Information  Cys terns, 


HEY  A IX  Page  y> 
OI/2O/8I 


[  19Y* ■  i  "Stochastic  realization  of  .gaussian  processes”,  Proceedings  of 
Mr;  PEEK,  08:  112- 122. 

[  1977  I  "Some  connections  between  the  theory  of  sufficient  statistics 

arid  the  identifiaoility  problem",  SIAM  Journal  of  Applied  Math¬ 
ematics,  53s  383-398 • 


[19,0]  " ldentifiability  of  a  linear  relation  between  variables  whichi 

are  subject  to  error",  Econometrica,  18:  375-389. 

J.  RTSSAIJEIJ  and  T.  KAIIATH 

[19721  "Partial  realization  of  random  systems",  Automatica,  8:  389-39-- 


II.  It.  ROf  KiTBNOCK 

[1970!  State-Space  and  Multivariable  Theory,  Wiley,  257  pages. 

P.  SCHONFEl,!) 

II979!  "Identification"  (in  German),  in  Encyclopedic  Handbook  of  the 

Mathematical  Economic  Sciences,  edited  by  M.  Beckmann,  G.  Menges 
arid  P.  Seiten,  ...  pages  6l-66. 


A.  TANNENBAUM 

[192,1]  Invariance  and  System  Theory:  Algebraic  and  Geometric  Aspects, 
Springer  Lecture  Ilotes  in  Mathematics  No.  999>  pages. 


C.  VAN  P’TTETT  and  J.  H.  VAN  SdlUPPEil 

[1979]  "On  stochastic  dynamical  systems",  Proceedings  8th  International 
Symposium  on  the  Mathematical  Theory  of  Networks  and  Systems, 
July  1979,  Delft,  NETHERLANDS  J 

M.  v.  H.  WHITMAN 

[19  J<j  j  Reflections  of  Interdependence:  Issues  for  Economic  Theory 
and  NS  Policy.  University  of  Pittsburgh  Fress,  318  pages. 


If.  WOLD 
[197:! 


Synther.e, 


